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The scope of the paper is to review the two major 
interest inventories, exploring the nuances and complexities of the 
technical aspects in their development, their item sampling, normingr 
scoring, reporting of results, and changing patterns of interests in 
relation to the differential treatment of sexes; and to suggest 
guidelines to eliminate or alleviate any potential sex biasing 
factors. Using the operational definition of sex bias as any factor 
that may influence a person to limit career opportunities solely on 
the basis of gender, these areas are explored and the following 
recommendations, among others, made: Interest inventories should not 
have separate forms of the instrument for males and females, and the 
same set of items should be used for both, with care that an item is 
not inherently more applicable to one gender than the other; in the 
development of externally based scales, every effort should be made 
to collect adequate samples of males and females for each occupation 
represented on the Inventory's reporting of results; and caution 
should be exercised against differentially including items that 
represent non-valid sex differences. (&J) 
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INTRODUCTION 

As part of the larger concern over various types of dlscrlmlna- 
tiOR and bias In the occupational world* the Impact that Interest 
Inventories have in guldinq people Into careers and vocations Is being 
studied. For some people, the choice of a vocation may be a raoot 
question; for example, th& classic case of the offspring who Itas knoiin 
from childhood that he or she will become heir to the parent^ business 
may have little choice in a -areer decision; or the person who 
is In the right placfi at th2 riciht tinift may have oppoi tunUli-s open 
up that no interest Inventory could predict or help in the decision of 
a lifetime career. 

But for the majority of people, their choice of a decision as to 
what field of vrork they should enter is perhaps as important a decision 
as whom they should marry and both have considerable Impact on 
tlieir lives. Professionally trained counselors continually help people 
who arc dissatisfied with their current jobs and desiiL- a change 
to something they will find more personally rewarding. Students who 
are graduating from high school are typically indecisive about what 
curriculum they should enter in college or in a business/technical 
school and thus seek guidance. In addition, more and more women are 
re-entering the occupational world and college and seek professional 
help in their pursuit of career opportunities; 

To help the counselors in their role of providing career-plan- 
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ning assistance to their clients » interest inventories have been re- 
lied upon to provide data on client's vocational Interest preferences 
and to help increase the probability that a chosen career or job will 
be a satisfying one. 

Typically, the major interest inventories have separated responses 
of males and females during the developmental stages and typically 
report different results for males and females. This differentiation 
of the sexes may create a limiting effect on the career options that 
are available to one sex or the other and thus sex biasing may be 
Introduced. 

A tentative operational definition of sex bias has been adapted 
by the National Institute of Education, Career Education Program plan- 
ning group as follows: 

"Within the context of career guidance, 
any factor that night influence a person 
to limit— or might cause others to limit 
—his or her consideration of a career solely 
the basis of gender." 
The scope of this paper will be to review the major interest 
inventories and to explore the nuances and complexities of the tech- 
nical aspects in the development of interest Inventories, their 
item sampling, norming, scoring, reporting of results, and changing 
patterns of interests In relation to the differential treatment of 
sexes and to suggest guidelines to eliminate or alleviate any poten- 
tial sex biasing factors. 

Major Interest Inventories- «a brief overview 

Currently, there are two major Interest Inventories that are 
used to provide results that are helpful In vocational guidance: 



1) the Kuder Occupational Interest Survey ^ and 2) the Strong Vocational 
Interest Blank* Both of these major inventories have handled thp roles 
that males and females have in the occupational world in slightly 
different ways. 

The Kuder Occupational Interest Survey (OIS) is intended for 
college-educated subjects (Kuder, 1966). It contains 100 items in a 
forced-choice format; for example, the subject is presented with a 
pattern of three possibilities, such as "Go to the movies", "Play 
cards", and "Go to a big party", and the subject is to pick one of 
three like most and one liked l^ast or disliked. Results indicate 
the similarity of the person's interest in relation to satisfied 
subjects in a variety of occupations and coUepe majors . 

About 60 occupational -based scales and 30 college-major based 
scales are reported for the inventory. Male and female subjects 
rospond to the same items. An earlier version of .the Kuuer, Form uu. 
the results for males were based on male criterion samples v^hile 
scores reported for females v/ere based on female criterion samples 
and for some selected male samples where there was not a female 
criterion sample. Thus, males and females were treated separately 
in scale development and reporting of the results.* 

Recently, the reporting of scores for the Kuder test has includ- 
ed all scores for all subjects (Kuder, 1974). Scores are reported 
for males based on male criterion groups and female criterion groups, 
and results for females are handled In an iidentical way. However, 
male and females are still treated separately during scale development. 

The Strong Vocational Interest Blank (SVIB). like the Kuder, is 
geared more for professional occupations than non-professional occu- 
pations (Campbell, 19C6; Campbell, 1971; Strong, 1943; Strong, 1959), 
Generally, each subject is asked to respond "Like", "Indifferent", or 
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"Oislike", to 400 items that cover a variety of areas such as occupa- 
tional titles, activities, and amusements. Currently, about G5 occu- 
pational scales are reported, such as Mathematician, 'Chemist, Liff> 
Insurance Sales. In addition, there are about 20 basic interest scales, 
or hwsogeneous types of scales that measure interest preferences in 
broader terms, such as ftechanlcal. Teaching, Sales, and Sports. (Thr 
exact nutjiber of scales reported depends on the sex of the subject). 
The SVIB treat- males and females distinctly front ti.c- very start v.ith 
a separate set of test Items for males and females. Although m:!ny 
items are identical betv-een the make and female foms, about ACC of the 
items are unique to each sex. 

Therefore, both the Kuder and Strong inventories may introduce 
a potential sex bias by a differential trcatniant of tlic sexes. The 
introduction of sex bias can occur at Several diffcjc.it Itag:;? ^ia 
development of an Interest inventory: 1) in actual test items thr;t 
appear on the inventory; 2) in construction of scaler-; and 3) in nonn- 
ing and reporting of results. 

TEST ITEflS 

The various forms of the Kuder invttntories have one form to be used 
by both males and females, and thus all subjects *c«re asked the sam 
set of questions. There is a non-differontial treatment of males 
and females at the item level. 

The SVIB, however, is unique In offering a separate test booklet 
and different Items for each sex. Although there is considerable 
overlap in the item content between the male and female versions of 
the SVID, more than 1/3 of the test items are unique to each sex. For 
example, the male version of the SVIB, form T399, asks the subject to 



respond "L1ke\ -Indifferent, or "Dislike", to the following occupa- 
tional possibilities: "Military Officer". "High School Principal". 
"Geologist", and "Public Relations Man", but the female version, TI«98. 
emits these items. Likewise, females are asked to respond to "Super- 
visor in Telephone Office". "Stewardess". "Receptionist", and "Fashion 
Model" and these itenis are not in the male version. 

There are both males and females gainfully employed in the above 
occupational exainples-there are female military officers and male 
fashion models and these Items should be acceptable to ask of either 
males or females. When different sets of items are asked of males 
and females, there may b? the speculation that some types of activities 
and vocations are more appropriate for one sex and not the other. 
Doubt may be raised about the appropriateness of females "liking" 
to be in the military or having mechanical interests, or of males 
"liking" to do cooking, or teaching children, or being a flight 
attendant. 

The SVIB has a long history of providing separate test forms of 
females and males. The first version of the SVIB was published in 
1927 and was intended to help provide vocational guidance for males; 
six years later in 1933 the SVIB for females was published. During 
the 1930s there were differing employment patterns for men and women, 
most of the trades and professions had a heavy representation of males 
such as mechanics and dentists while women dominated the secretarial 
positions and elementary school teaching postions. Two forms were 
kept in existence for the last 40 years, primarily because of the 
differing patterns of males and females in some occupations and 
because there was demonstrable differences in response -.patterns of 
men and women to the same item. 

The above factors sufficed to keep the SVIB male and female forms 



EJUC separated in the past but growing concern for equal rights and equal 



opportunities an^d concern over possible career limitations has becoine 
of primary Importance. 

Recommendation 

The : jowroended solution for avoiding any possibility of sex bias 
at the Item level Is to ask tbe same set of questions to both males and 
females. Special care should be taken so that the Items are phrased 
so that they are not inherently referring to one gender; for example* 
Police Officer would be preferable to Policeman or Pol1cev;oman, Realtor 
would be better than Real Estate Salesman, Sales Clerk Instead of Sales- 
lady. 

When gender can not be eliminate from the item, the item should 
Include both possibilities; for example, Dressmaker/Tailor, Waitress/ 
Waiter. Airline Flight Attendent would be better than Airline Steward/ 
Stewardess from sexual bias standpoint, but perhaps not as easily 
understood by high school students (the reading level and comprehension 
of the items also must be a consideration If the inventory Is to be 
applicable to high school students). 

Test developers should give serious consideration to requesting 
the U.S.-GSA Civil Rights Office to recommend changes in item content 
so as to preclude bias toward any minority group. 

While possible, with enough affort and care in writing items so 
that one gender Is not inherently favored, this is perhaps the easiest 
solution in avoiding sex bias during the development of an interest inven- 
tory, rtoles and females do show different base rates of respondinn 
to the same item and this is elaborated below. 



Hale-Female Item Response D1 fferences 

Male and females do differ In their base rate of responding to 
Interest Items. For example, when responses to the SVIB Item "Interior 
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Decorator" are 1n-^stt gated by gender, a substantial majority of women 
answer "Like" {67% of a sample of 1000 employed women representing an 
••average"). While, In contrast, a minority of men respond "Like" 
of an "average" male sample, N=1000). Consequently, when a male 
responds "Like" to the Item he Is giving an unusual response and actually 
Is responding in a manner similar to those of artists, actors, and 
architects. While a female responding "Like" to the sam^. item is 
indicating little that Is unique for her gender. 

Similar types of male-female differences are found for the Kuder. 
For example, in responding to the item triad of "Go see a fire", "Go 
to see an accident in which people have been hurt", "Go see a famous 
person riding along the street", 5755 of men-in-general Indicate they 
would rather go sec a fire in contrast to 33% of the women- In-general. 
While 60% of the wcMnen- in-general would prefer to see a famous person 
only 33:5 of the nen-in- general uculd prefer that cctivity over t\\s 
other two possibilities. 

As Indicated above, male-female differences do exist in their 
responses to the same item and as indicated below these differences are 
apparent during adolescence and are substantial . 

Results from Teen-agers 

Research on the differing patterning of Interests between males 
and females by Campbell (1974) indicates that such differences appear 
even among early teen-agers. The following table lists occupational 
items that a majority of 8th grade males on females "Like" from a 
list of 130. With exception of occupations "Cartoonist" and "Profess- 
ional Athlete", there is a definite difference in the responding of 
males and females even at this early age. 

Insert Table 1 About Here 



Tabic 1 

Popular Occupations Anong Male (ri«ai) an> Female (H«76) Ciqhth Graders 



Occupations marked 


"Like" by more than ono-half of tho bov 






Male. 


Fenalo 






Percentage 


Percfrntane 


Percc'U^n • 


Occupation 


"Like" 


"Liko" 


niff crone 


Auto Racer 


65:; 


2nr, 


41:; 


Jet Pilot 


57 


22 


35 


Cartoonist 


57 


61 


-n 


Professional Athlete 


53 


45 


G 


Inventor 


51 


17 


34 



Occupations n.irl-nd "Like" 


f'V nore tlmn onn- 


-hal f of tho nirls 






Fenale 


M-ile 






Percentage! 


Perec ntaqo 


Percentfl"* 


Occupati on 


"Like" 


"Lik«" 


Di ffcrence 


Children's Clothes Oes inner 


76:' 


14r, 


r>2r. 


Interior Decorator 


68 


21 


47 


Fashion Model 


66 


11 


55 


Costume Designer 


64 


13 


51 


Stev/ard/Stewardess 


64 


20 


44 


Actor/Actress 


63 


33 


30 


Home Economics Teacher 


61 


12 


40 


Cartoonist 


61 


57 


4 


Elementary Teacher 


61 


2G 


35 


Nurse's Aide/Orderly 


59 


10 


49 



id 
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Table 1 continued 

Manaser, Child Care Center 50 11 A7 

Dressmaker/Tailor 57 15 42 

Photographer 56 31 

Manager. Uoincn's Style Shop 55 10 4S 

Waiter/Viai tress 55 16 31 

Artist 54 35 19 

Typist 53 9 44 
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As Table 1 Indicates, there are considerable differences between 
young males and females. As they grow older, the magnitude of the 
differences does diminish somewhat, but never vanishes. The SVID ha^ had 
a Masculinity-Femininity (M-F) scale for many years to measure adult 
male versus adult female Interests based on Item responses, as have 
many personality tests such as the Minnesota Multiphasic Personality 
Inventory . (Welsh and Dahlstrom, 1956). and the California Psycholog- 
ica l Inventory (Gough. 1969). 

Results from Adults 

With development of a female version for the SVIB in the 1930s . 
Strong --'as able to measure the extent of alffcrent base rates of 
responding to Interest Items by males and females and developed M-F 
£pd F-!t scales to measure those differences (Strong. 19A3). Research 
by Johansson in 1969 (reported in Campbell, 1971), further refined 
these scales. 

Refined H-F and F-M scales were developed as part of a project 
studying male-female differences within occupation to the same item. 
When occupational membership was held constant, employed adult males 
ar females showed significant and practical differences. Fourteen 
occupations were available that had an adequate sempling (sample sizes 
about 250) of both males and females. All subjects indicated that 
they were satisfied with their jobs and all had bean employed in their 
occupation for at least three years. Each occupation was represented 
equally. Thus, the samples were very precisely defined and potentially 
spurious results from comparable random samples of males and females 
was lessened. 

This research was importa.nt In Investigating the extent of similar- 
ity or dissimilarity between an average composite of males and females 
when samples from the same occupations were compared. Investigation 
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of these male-female differences witbin each specific occupation leads 
to three tentative hypotheses (Casipbell, 1974): 

1. Men and women in the same occupation do not differ in their 
Interest preferences. 

2. Nen and women h^ive different Interests which are constant 
across all occupations. 

3. Men and women have different interests which are specific 
to each occupation. 

Table 2 below shov;s the numbar and percentaqo of itcn*: that 
showed large differences (15^S or greater) between males and females 
within each occupation. Clearly there are considerable differences 
between males and females even when occupational membership is control- 
led; about 30% of the items in question showed large differences. 
Items that were investigated wer*a the 229 items that were overlapping 
items between the male and female forms of the SVTB. 



Insert Table 2 About Here 



Those items that showed significant differences betv/een the sexes 
were aggregated into the refined Masculinity-Femininity scale. Inter- 
ests that v/ere more typical of males than females included outdoor, 
adventuresome types of activities, and business interests. Interests 
more typical of females involved a liking for cultural activities, 
interest in fine arts, music, art, literature, and so forth. 

To Investigate whether differences are constant or unique to each 
occupation, the follov/ing table lists all items that showed very large 
differences (25% or greater) between the SVIB men -in- general (MIG; N« 
1000) and women-in-general samples (WIG; N-1000). For the first 14 
Items, females responded "Like" more frequently than males; for the 
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number of Itens of 229 That Shav "Like" or "Dislike" Rnsponse. 

Differences of 15:; or Greater Bctvieen Males and Fenales 

Male-Female Differences 

nccn!> nti on Nt.- nher of Itcr;s Percontan? 

Artists 60 26?; 

Bank Personnel 07 

Chemists 57 25/- 

English Teachers 73 3?r. 

Interior Decorators 47 20?: 

La\/ycrs 67 29',^ 

Life Insurance Sales 01 3^" 

Mathematicians 19 21" 

riath-Scionco Tcaciiers 93 41/ 

tlcdical Teclir.Dlo()ists 55 24^ 

News Reporters 76 33*;'. 

Physicians 61 27'', 

Psychol onists 47 2r. 

Social Science Teachers 96 425i 

Averaoe 69 30% 
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last 5 items, male responded "Like" more frequently. Also, the differ- 
ences between the percentage of males and females for the 14 occupations 
who responded "Like" to each item appear as the table entries. For 
example, 612 more of the females in the in-general sample prefer to 
"Decorate a room with flowers" than do males (752 of the viomen versus 
142 of the men). The next entry indicates the difference befc/een male 
and female artists, next for bank personnel, and so forth. The last 
entry indicates the average across the 14 occupational samples. (Camp- 
bell, 1974). 



Insert Table 3 About Here 



Inspection of the first and last columns indicates the entries are 
very similar, differing by only one or two percentage points. Thus, 
as the first voluimi IriuiCdleii, ii>en dnii wuiikmi in a 'general" sample do 
have different response patterns to the same item and the last column 
indicates that these differences are fairly constant between males 
and females even when occupational membership is held constant. 

The above illustrated items that shov/ed very large differences 
between the sexes; approximately one-half of the 325 items on the 
revised SVIB (that will be released in 197^) indicate differences 
of 15% or more betv/een males and females comprising the in-general 
samples. (The increase over the 302 for the 229 in Tabic 2 is a result 
of including items relating to domestic, mechanical, and military types 
of activities that previously appeared on only one of the SVIB forms 
and these types of activities show large differences). 

Thus, even though good Items can be written for males and females 
to alleviate potential sex bias* by not inherently referring to one 
<3 gender; different response patterns by sex are a concern for scale 
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constructlon and reporting of results. 

INTEREST INVENTORY SCALES 

Another stage at which sex bias may become apparent is during the 
construction and norming of scales for an interest inventory. Two main 
types of scales frequently are developed: 1) criterion scales whei-e a 
person's interest preferences arc associatsd with those in a criterion 
sample such as an occupational sample of artists or college students 
majoring in biological sciences; and 2) homogeneous or basic interest 
scales that measure the underlying interest dimensions of the inventory 
and a person's interest preferences are related to those in a general 
reference sample. Each has its unique advantages in the type of 
information that it captures from items on the inventory and each 
differs in the way that potential sex bias may be introduced. 

Criterio n Scales 

Strong 

The development of criterion or occupational scales for the 
SVIB involves the contrasting of item responses of employed people 
in a specific occupation (criterion sample) with a sample representing 
a population "average" (in-general sample). Each occupational scale 
contains those items that significantly differentiate criterion sample 
from in-general sample. There are various methodological considerations 
and problems that have to be considered in collecting a criterion 
sample and defining an in-general -sample (see Campbell, 1971; Clark 
and Campbell, 1965; Strong, 1943), but for purposes of this paper the 
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assumption Is that adequate samples are available. 

Therefore, after Item responses for the criterion sample and 
In-general sample have been collected, item response percentaqes for 
both groups are calculated and contrasted with each other. Items that 
show significant percentage differences (usually 15^-20%) between the 
two groups are included in the occupational scale. Response patterns 
(for example, Like, Indifferent, or Dislike) for these significant items 
typically are assigned weights based on the magnitude and direction of 
differences bstween criterion sample and in-general sample. The result- 
ant scale is then normed on the criterion sample used for scale devel- 
opment. 

To illustrate, the folloi-<ing example shows the response percent- 
ages of male psychologists and the male in-general sample for the two 
items "Author of Novel" and "Employment Manager." 

"Author of Novel" 
Male Response Response 

Psychologists Men-in-General Differences Weights 

Response (N=252) (N»1000) 

Like 8U 5U 30% +1 

Indifferent 15X 30% -15% -1 

Dislike 4% V% -15% -1 

"Employment Manager" 
Like 35% 29% 6% 0 

Indifferent 34% 41% -7% 0 

Dislike 31% 30% 1% 0 

As illustrated In the above examples, responses are empirically 
unit weighted if response difference; Indicate that there Is a signif- 
icantly large difference between criterion sample and In-general sanqile; 
responses are not weighted If the difference Is small. [After years 
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of research with the SVIB, a 15!6 difference or greater is considered 
Urge and meaningful; Campbell, 19712. Thus, if a person responds "Like" 
to the item "Author of a Novel" his or her raw score on the psychologist 
scale would be incremented by one, if the response was "Indifferent" or 
"Dislike", the raw score viould be decremented by one. Responses to the 
item "Employment Manager" would have no bearing on tlie score for the 
psychologist scale, but it would be relevant for some other scales such 
as the sales mamagcr scale. 

The number of items that differentiate an occupational sample from 
the in-general sample on the SVIB is usually between 70-90 itens or 
about one- fourth of the test. A person's raw score on each scale dev- 
eloped for the inventory is the sum of plus and minus unit weights 
that correspond to his or her unique pattern of responses for each scale. 
Th2 raw score is then converted to a standard sccrd using a raw-score- 
to standard-score conversion formula. Each person receives a standard 
score for each of the occupational scales developed for the inventory 
indicating the degree of similarity betwee his or her interest pref- 
erences and those in the cri'terion sample based on those items that 
were more characteristic ol the criterion sample than the in-general' 
sample. 
Kuder 

The Kuder CIS uses a somewhat different approach than the SVIB in 
the development of empirical criterion scales for the inventory. 
Because of difficulties found by Campbell and Strong in developing an 
adequate in-general sample, Kuder (1956) decided to dfrcunwent the 
methodological problems by use of a lambda coefficient which is similar 
in concept to biserial correlation coefficient. Lanfcda coefficient 
expresses the degree of similarity (correlation) between a subject's 
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responses and those members of the criterion group, such .s >n occupa- 
tional sample. This procedure effectively eliminates the need for an 
in-general sample. The upper 11»lt of lambda is 1.00 indicating 
con,plete similarity with nssponses of those in the criterion sample, a 
lambda of .00 indicates no similarity; these coefficients are used 
as the scores for "--T'^'""' '"t^^^t Surve..^. Because of the 

method of scale construction, noting and standard score transforations 
are not a concern as with the SVIB. 



Gender at. o Factor 

Slice the SVIB traditionally has separated the response patterns 
of .ales and females at the item level, so too are the responses separated 
at tne scale development level. Male occupational samples are compared 
with a r>alc in-general sample representing the "average" employed male. 

u.cu.oLio„.l saK>Us are ccparcd ■..•ith = fcr.al= in-g.ner,l 
sa,.l>le ro,>rcsont1ng the "average" employed female. Thus, It is not 
surprisinu to find that If a person completes both the male and female 
SVIB. different scores will be obtained for the same-named scale on both 
inventories; for example, a persor may obtain a score of « on the male 
chemist scale and 10 on the female chemist scale. Item content of the 
1ike-nanK=^ scales are different, criterion samples are composed of different 
people, and the (n-gcneral samples are different. 

The Kuder also separates the sexes during scale development. Some 
occupations such as Lawyers and Computer Progra,mers have separate and 
distinct scales for males and females. Other criterion scales such as 
Dean of Uo.aen cr Plumber are based on just one sex. Currently, a subject 
is scored on all scales regardless of sex and the same problem arises as 
with the two forms of the SVIB-where two scales exist for an occupation. 
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one based on females and one on males, discrepant scores result. 

The sex bias concern is tiiat there is not complete comparability 
in the scales that are developed for male and female form*; of the SVin 
and scales developed by qender for the Kuder. The female SVIB contains 
scales that measure interest preferences for various nursing occupations, 
airline stev/ardesses, entertainers, and telephone operators. However, 
the male SVIC does not give an indication of thcso specific occupational 
interests --there are adequate samples available for male airline stev/.-.rris, 
male nurses, nale entertainers, and male telephone operators (SchlossSern 
and Goodman. 1972). The male SVIB has a wider sampling of pr ofessional 
occupations r.uch as biologists, architects, psychiatrists, and physicists 
but these arc not available on the female SVIB. A similar sot of cir- 
cumstances exists for the Kuder. 

For someone concerned about sex bias in interest wasurement, 
separation by gender wh2n scales are developed and reporlcJ is a salit-nL 
focal point. For example, are interests measured by the male form of the 
SVIB different in scope than those measured by tlie female form? How 
important are these differences? If a woman wants the results of scales 
appearing on just the male form then there is the problem of hcsi qenerc-ilizable 
are the results from male criterion groups to her interest preferences? This 
is also true of the Kuder, but it is perhaps no.v more apparent. 

Subjects completing the Kuder arc now presented v/ith two scores 
for some occupations; with the SVIB, the subject had to take two tests 
to produce differing results. The follov/ing section investigates occupa- 
tional scale characteristics when they are constructed on samples sepa-* 
rated by gender. 
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Sexual Stereotypes In Criterion Scales 



The relative importance of sex differentiating items in like- 
named scales developed on male criterion san^les and female criterion 
samples is contained in research by Johansson and Harmon (1972) on the 
SVIB. The study specifically investigated the following areas: l)whether 
men and women in the same occupation have different interests; 2) whether 
these differences arc reflected in occupational scales for men and women; 
and 3) whether these occupational -scale differences are valid and useful. 

Fourteen occupational samples and the in-general samples were used 
for analysis as discussed previously (see pages 10-11). The following 
figure presents a paradigm of how item-response differences were analyzed 
using male and female criterion samples and their relationships to the 
in-general samples. 



Male 

Cri terion 
Sample 



Female 

Criterion 

Sample 



rten-In- 
General 
Sample 



Women-In- 

General 

Sample 



The male occupational scale Is based only on differences between 
the male criterion sample and men-in-general (Difference a). The 
female scale is based only on differences betv/een the female criterion 
sample and women-in-general (Difference b). Difference c represents 
Items that differentiate male-female criterion samples and exanples of 
this have been given previously In Tible 1. For those items that show 
response differences between men and women In an occupation (Difference c)» 
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a given item can have four possible effects on male and female 
scales. 

1, Malo-female differences are not incorporated in the scale. 

Despite a largo item-response difference (for purposes of 
this study, large was defined as 12X or greater) betv/onn males and 
females (Difference c), the item appears on both scales because 
differences between occupational and in -general groups (Differences 
a and b) are also large. Thus» sex difference is not differentially 
incorporated into the scales* The following exemplifies this con- 
dition where there are large response differences to the item 
"Electronics Technician" between male and female Medical Tech- 
nologists, between in-general samples, and betv/een criterion 
samples and same sex in-general samples. Even though the item if. 
a sexually stereotypic item (a larne difference bet>/een males and 
female?) ♦ it is weighted th?» samo for both male and female scales, 
and docs not produce a sex bias in the scale content. 

Non-Influential Male-Female Differences 
Reponse Percentages to the Item: 
"Electronics Technician" 



Response 


Medical Technologists 
Male Female Differences 


In-General Samples 
Men Women Differences 


Response Weights 
Hale Female 
Scale Scale 


Like 


55% 


35% 


20% 


22% 11% 


11% 


+1 


♦1 


Indifferent 


35% 


36% 


1% 


42% 30% 


12% 


0 


0 


Dislike 


10% 


29% 


-19% 


36% 59% 


-23% 


-1 


-1 
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Across all fourteen occupational samples, approximately one- 
fifth of the items differentiated males from females in the criterion 
samples and were non-influential because they were weighted identically 
on the male and female scales. 
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2. Valid male-feimle differences are Incorporated In the scales. 

While there Is a large difference betvceen the criterion groups 
(Difference c), only one of the differences between the 
criterion groups and tbe same sex in-general sample (Difference a 
or b ) Is also large i thus the Item appears on only one of the 
occupational scales. In this situation. Difference d in the para- 
digm Is small. Example 2 below shows that responses to the item 
"Computer Operator" are weighted on the male scale for Medical 
Technologists but not for the female scale. Differences between male 
and female Medical Technologists are large but this di-ffe^ence is 
specific to the occupation since the in-general samples show a much 
smaller difference. 



Valid iidle-Female Differences 
Response Percentages to the Item: 
"Computer Operator" 



Response 


Medical Technologists 
Male Female Differences 


In-General Samples 
flen Women Differences 


Response Weights 
Male Female 
Scale Scale 


Like 


3U 


28X 


3« 


14% 22% 


-8« 


♦1 


0 


Indifferent 


512 


34X 


17S 


37% 29r, 


8% 


♦1 


0 


Dislike 


18S 


38% 


-20% 


49% 49^5 


0% 


-1 


0 



The above Item Illustrates the Inclusion in scales of a valid 
or legitimate sex difference in interests between males and females 
In the same occupation. Averaged across all 14 occupational samples, 
approximately m of the scale items fell Into this category. 
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3. Non-valid male-female differerttiatinn items incorporated in the scales. 
Items that differentiated the two in-general samples (Difference 
d) were regarded as sexually stereotypic since it was true of all males 
and females and for males and females in the criterion samples. When 
such a sexually stereotypic item appeared on only one scale, then it 
seemed to be spuriously related to the magnitude on the difference 
betv/een criterion sample and in-general Sdinple. The example below 
shows this relationship for the item "Art Galleries." tiales and females 
in the criterion sanples and in-neneral samples show largo differences 
but the item is only weighted on the male psychologist scale. 



Non-valid Hale-Fem^ile Differentiating Item 
Response Percentages to the Item: 
"Art Galleries" 

Psychologists In-General Samples R*»^non?;e Weiqht^ 

Hale Female 

Response Male Female Differences Men Women Differences Scale Scale 
Like 66« 85?; -205; 42-:: S6% -24% +1 0 

Indifferent 25% IIX 14^5 39% 24?I 15% -1 0 

Dislike 9% 3% 6% 19% 10% 9% -1 0 

While items as exemplified above did represent significant differ- 
ences between males and females in the criterion samples and the in- 
general samples, the items were related to sexual stereotypes that had 
been included fortuitously in the scale for one of the sexes. Fortu- 
nately, only a few items of this type are included in the scales. 
4. Non-valid in-general differentiating items incorporated in the stales. 
Again, differences betiveen the in-general samples (Difference d) 
were considered sexually stereotypic. If the item showed a small 
difference between the criterion samples (Difference c), but was 
included in one of the scales because a large difference betvreen a 



crlttHon sample and in-general sample (Difference a or b). another type of 
invalid difference had been included- in the scales. As illustrated 
below, nile and female psychologists respond similarily to the item 
"Inventor" but the in-general samples show large differences. The 
item 1$ weighted on the female psychologists scale but not on the male 
scale. The male and female scales differ, not because people in the 
occupation differ, but because in-gcnerr,l samples differ. 



Kon-Valid In-General Differentiating Items 
Response Percentages to tho Item: 
"Inventor" 

Psychologists In-General Sanplos Response V.'cinhts 

M?le Fenale 



Response 


Male 


Female 


Di fferences 


rien 


Women 


Di fferences 


Scale 


Scal( 


Like 


73« 


7U 


2% 




44% 


18% 


0 


♦ i 


Indifferent 


22% 


22% 


0% 








0 


0 


Dislike 


St 


7% 


-2% 




21^ 


-ll^'. 


0 


-1 



These last tv/o types of differences {^3 and *'4), influencing 
one scale and not the other, did not represent d valid inclusion of 
the Item In the scale for one sex and not for the other. Across all 
14 occupational scales that were studied, about 18*4 of the items on 
the scales were these types of Items that represented the inclusion 
of non-valid sex differences in the scales. 

In sunwary, the study showed more than 70% of the items on the SVIU 
scales studied did not differentially Incorporate sexually stereo- 
typic Items Into male and female scales. Of the remaining Items on 
the scale, about 10!5 were items reflecting valid sex differences and 
less than 2055 were invalid Inclusions. Thus, depending on one's 
point of view, 70% can be taken as the over-riding factor with little 
worry about sex bias or 20% can be glared at as justification for 
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change . 

Solutions to the Problem 

As the preceding data have indicated, there were differential t'ase 
rates of responding for the sexes to individual itens on an interest 
inventory. As will be shown later, these differences have remained 
stable throufjh the late 1960s and probably will be apparent for years 
to come. These differences can not be ignored and become a vexing 
problem v-»hen occupational scales are developed. There are four 
pr.ssible strateciies one can use in handling this problen as outlined 
by Campbell (1974): 

1. A simple solution would be to ignore sex and randomly collect 
criterion samples and let the proportions of males and females 
in the sample match the population split. There would be two 
attendant disadVi-'ntages to this approach. One, some occupations 
still have a low percentage of one sex in the occupation, for 
example, female carpenters, and collecting a tryly random sample 
with sufficient representation of both sexes v/ould be impossihl-'. 
Iwo, if occupational scales are constructed by contrasting a 
criterion group with an "average" sample, such as an in-general 
sample, the composition of the in-general sample would over- 
represent those occupations where there is a preponderance of 
one of the sexes. 

2. A second possibility would be to use an equal representation oT 

males and females in the composition of criterion .groups, again 

developing just one scale for both sexes. As in tlie previous 

alternative, adequate samples of both sexes are extremely 

difficult to coUeot and almost impossible for some occupations. 

Where there are sufficient members of an occupation, male response 

o preferences could be statistically weighted equally with female 
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response preferences. !iowever, for soma occupations, a low rep- 
resentation of one sex would lead to instability in the comhined 
responses for the total firoup. This proceiluro v.ould vMi.H-c"it 
items that are valid for one sex but not for the other sex. for 
example, "Decorating a room v;ith flcv/ors" is moro important if a 
male responds "Like" than if a female responds "Like". 

3. A third possibilty would bo to develop separate scales for iiiales 
and females and then to equate scores throunh a statistical 
formula appropriate for each occupation. This would be cnalonous 
to procedures used be some graduate schools in applying differential 
weights to grade-point averanes obtained at vorious undergraduate 
institutions, with "h'^avier weinlits" corresponding to the more 
academically "hard" institutions. However, considerable research 
would have to be done on such modifications of scores to invcstinitir 
tne impact of such a {.n i>ueJuio . T;.c on.i ; csi;U prrhcblv v.'O'jld 

be the same as the second alternative above. 

4. A fourth possibility v.ould be to develop separate scales for males 
and females as is currently done for the Kudor and 5VIB. This is 
the easiest to do from a developmental standpoint and yields the bfst 
predictive and concurrent validity for the inventory. Studies by 
Berdie (19G1), Dunnette and Kirchner (1^58), Frodcriksen, Melville, 
and Gilbert (1954,19-0), Ghiselli (1%3), Grooms and Endler (1%0), 
Johnson and Johnnssm. : .72), and Seashore (1961), have shown the 
efficacy of moderator variables in increasing the validity of 
measurement. Using sex as a moderator variablf? increases the 
validity of scales by including those items that are most differ- 
entiating for each sex. 

Recommendation 

o Collection of occupational samples is a costly venture, but the 
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demonstrated predictive validity of empirical scales developed on 
criterion samples makes the utility of thfis approach worthwile. Where- 
ever feasible, a concerted effort should be made by the test developer 
to collect adequate samples of each sex for an occupation. • Construction 
of separate scales for males and females Is technically sound from the 
standpoint of concurrent validity. However, it is recommended that 
greater care be given so that non-valid sex differentiating items 
(as outlined in pages 24-26) are not incorporated into thr> scales. 

Use of Criterion Scales Devel oped on the Opposite Sex 

Currently, scores are. reported for subjects based on opposite-sex 
criterion samples for the Kjder and for the SVIB if the opposite-sex 
test is administered. This procedure will continue until : adequate 
representation of both males and females in all occupations is accom- 
plished. The utility of this procedure has been explored in various 
research studies. 

Research by Cole (1973) on female interests and Cole and Hanson 
(1971) on male interests suggests that there are similarities between 
occupational configurations of women's interests and configurations 
of men's interests using data on the SVIB, Kuder, and two other inven- 
tories. These configurations could be useful in providing additional 
information about career opportunities for males and females even 
though there are not relevant specific scales on the inventory. The 
aboye research indicates that there Is enough similarity in interest 
structure between the sexes that generalizations beyond the status 
quo of an inventory are possible in exploring new career opportunities 
for males and females. While these findings are important In under- 
standing configuration of interests, test users will find occupational 
scores the easiest to use and there still Is the problem of reporting 
scores for females based on male criterion groups and reporting 



scores for males based on female criterion groups. 

Applicability of occupational scales developed on male criterion 
groups for females taking the Kuder OIS has been studied by Hornodary and 
Kuder (1961). They found that scales differentiated for women as v/ell 
as they did for males for nine of ten scales studied. In a similar study 
Kuder (1956) found hiijh median correlations between scores based on 
male criterion sanples and feuialc criterion samples for three snnplos 
of women. Kuder concluded that rcportino of scores for female*? h-'sed on 
male criterion samples was a valid procedure for reproscntinfi their in 
fields v/here there are opportunities for \/omen but criterion data v/as not 
available. Kuder (1966) also stated that if a v/oman enters an occupation 
dominated by males, she will find greater satisfaction if her interests 
resemble those of the males in the occupation. 

More recently, Kuder (1974) stresses the importance of givinq more 
emph-isis to scores based on titc subject 'r. o.,n sex and uses scorc-^ based 
on the opposite-sex criterion scales to nive added insinht into a subject's 
interests. If the person scores hifihest on several of opposite-sex 
criterion scales, these scores may indicate good possibilities for further 
exploration if not represented by same-sex scales. 

Oarley and Hagenah (1955) found that using botfi forms of the SVID to 
be beneficial for women w|jo had a high degree of maturity, ability, and 
career motivation. Strong (1959) suggested that the male form could be 
used for women who have interests (via the M-F scale) and career 
aspirations similar to those of males. Thus there is some evidence on 
tlie efficacy of reportino scores for scales based on criterion samples 
that are of the opposite sex of test taker. 

A study by Laime and Zytowski (1964) also indicated that for women 
who took both forms of the SVIB, their score on several males scales 
could be predicted from corresponding female scales. In 



addition, for mare professionally oriented females, tbey tended to 
receive higher scores on certain scales of the male form than on the 
female form. 

Huth (1973) cites criticisms vf the SVIB as it pertains to 
raeasurenient of female interests. She generally concluded as did Super 
and Crites (1962) ten years earlier, tliat the female SVIB does not 
show good differentiation of interests for the majority of wotnen. 
There seemed to he a contmonuus of interests among wornen that makes 
differentiation difficult except in those cases v;herc women have a 
clear-cut interest. The nenernl pattern seems to he a "hore versus 
cai-eer" orientation. Thus, if women have had a strcnq caref-r orienta- 
tion, the male SVIR has been ir.ed froqMcntly to provide the necessary 
differentiation of occupational interests. 

A more extensive analysis of the sane type of da^a was done by 
C;nnnhr>n {1974). Ilninn only it«»n«; thft vforo ccv^'^n to m=»l'» ?n'< 
female SVIQ, 31 male and 31 female occupational scales we»'e developed 
and nonned on appropriate-sex norm groups. For example, a ma3e 
Physica'i Therapist scale was developed and nonned on male physical 
therapists; likewise, a female Physical Therapist scale was developed 
and normed on female phy«;ical therapists. Next, a general sample of 
200 males and 201 females v/?re scored on the 62 scales (31 male-based 
scales and 31 female -based scales), correlations wore computed between 
the two scales for each occupation and rioan differences in score 
value were derived. Table 4 presents the results of this study. 



Insert Table 4 About Here 



The median correlation between same-named scales for the SVIB 
reported in Table 4 (r«.76 and .77) was equal to the magnitude obtained 
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BEST COW ^VAIUBIX 
by similar studies on the Kuder OIS (Kuder, 1966). The magnitude 
of the correlations would Indicate that there Is a great deal of similar- 
ity betv/een scales developed po male criterion groups and female 
criterion groups, but not sufficieot similarity to interchange scales 
carte blanche. 

Hales in the general sample were scored on both sets of scales 
for each occupation as were the females. The moan difference column 
entries are important in investigating the scores for females on scales 
developed on Por.ale criterion samples. Of importance is the direction 
of the difference betv/een the male scale means and female scale means. 
Since the means are for random samples of males and females, low scores 
on occupational scales are desireable; negative differences indicate 
that appropriate-sex scales are superior and positive differences 
indicate the obverse. The results indicate that scales based on male 
criterion group«i work hpist for male?:, pnd scales f'?*?^' on fetr^ale 
criterion groups v/ork best for females. 

The average absolute difference between the two scales for each 
occupation in Table 4 was about one-half standard deviation(5.5 standard 
score units for males and 6.7 for females). If there were no overall 
differences in scales developed on male and female criterion samples, 
then the difference would be about zero. Tbis clearly was not the 
case, mean differences of the magnitude of one-half standard deviation 
on SVIB occupational scales arc both statistical and meaningful 
di f ferences . 

In investigating item composition of the above scales, Campbell 
found that scales that contained sexual stereotypic items influenced 
the magnitude of the score differential between males and females. 
Hales tended to score highest on female scales dominated by "male" 
types of items, such as the female scales for Amiy Non-Commissioned 



Officer and Army Officer, than on the corresponding male scale, while 
females scored highest on male scales dominated by "female" types 
of items, such as the male scale for College Professor, Musician, 
and English Teacher. 

Rcconsncndation 

So as not to limit career opportunities avail?.ble to males and 
females, it is recommGnded that scores be reported for all sc«los 
available and v;hen possible based on the approrriato sex. l!i»on score!, 
bcscd on the opposite sex are reported. It is imperative that test 
users be made cogently aware of this. When these opposite-sex 
scores arc used, they should be Interpreted with sexual stereotypes, 
and their potential effect on scores, kept in mind. 

INTERNALLY RASKD SCALES 

The second major type of interest scale developed for interest in- 
ventories is based on internally related items and labeled as homonencous 
scales or basic interest scales. Generally, each Inventory, throuah 
its item content, covers a range of basic interest dimensions and scaVrs 
can be constructed through statistical procedures to measure those 
dimensions, for example. Mechanical interests. Social Service interests, 
and Numerical interests. Unlike the occupational scales where the 
number of scales developed depends on occupations that test 
developers are willing to spend time and money to test, the homoQenous 
scales are internally-based and developed to tap all interest domains 
of the inventory.. The male SVIB has 71 homogenous scales and the 
female SVIB has 19. The Kuder OIS does not contain these types of 
scales, but a version Intended for junior high school students, Kuder 
General Interest Survey, has 10 such scales. 

ERIC 



Scale Development 

Typically, a general sainple is tested v/lth the inventory and 
product-moment correlation coefficients are computed for all pain/ise 
combinations of items by assigning numeric v/eights to response patterns. 
Then using factor analsis or cluster analysis, highly interrelated 
items are aggregated into a scale and a descriptive nane applied that 
reflects some connon psychological theine that is beifj^ measured by 
the Individual items. 

For example, the item responses of "Like", "Indifferent'.', and 
"Dislike", on the SVIB v/ere assigned weights of +1,0,-1 respectively 
and then all pair-wise item intercorrelations v/ere computed usinfi 
the responses of a general sample for the male form, nnd a general 
sample for the female form. The example bolwi shoivs a correlation 
of r-.63 between the items "Algebra" and "Arithmetic" based on the 
responses of 500 males. (Ine table entries are response percentafje*: 
of a general sample of employed males to tv/o itpms considered as a 
pair; for example, 51?; of the subjects responded "Like" to both "Algebra" 
and "Arithmetic"). 

"Algrbra" 



"Arithmetic" 





Like 


Indifferent 


Dislike 


Total 


Like 


51% 


m 


6% 


69% 


Indifferent 


3% 


11% 


6% 


19% 


Dislike 


1% 


1% 


10% 


12% 


Total 


55% 


24% 


22% 


100% 
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These two items, "Algebra" and "Arithmetic", in addition to the 
items "Calculus", "Geometry", "Mathematics", and "Physics" appear 
on the basic interest scale MATHEMATICS for the male SVIB, The same 
procedure was used for the SVIB female basic interest scales using 
a general sample of female responses. As with the occupational scales 



the person's raw score on each scale Is the sun of the response 
weights for the items included on the scale. 

Again » because the SVIB has used separate sets of items for males 
and females, separate basic interest scales have resulted with many 
overlapping concepts, but still some unique scales appear. For 
example, males are not scored on a DOMESTIC/HOMEMAKinri scale nor 
are females scored on a MILITARY ACTIVITIES scale. (This uniqueness 
is a direct result of not having sufficient appropriate item contents 
on the inventory.) The previous reconimenddtion that the same set 
of items be given to both sexes would eliminate this uniqueness in 
Qcnerction of results. 

The ten homogenous scales for the secondary school version of the 
Kuder has identical homogenous scales for males and females and 
permits meaningful comparison between the sexes on their interest 
pr^'forencec. Usincj the sar:c set of itnrr.s for all suwjects, the 
Kuder has overcome some of the objections of the SVIB. 

Use of intercorrelations as the basis for scale development 
circumvents many problems associated with development of occupational 
scales. If the same set of items is administered to both males and 
females, it is reconnended that a general sample with equal represent- 
ation of both sexes be used in the construction of these types of 
scales. The predictive validity of intemally-basod scales is 
of a relatively lesser importance than for criterion scales and 
differential rates of responding for m^les and females are not a 
real problem until scales are normed. 

Norming of Internal ly»Based Scales 

The reference sample used for norming homogeneous scales is 
determined by the test developer and generally is influenced by which 
^ reference sample would be most relevant for Interpreting the results. 



ERIC 



If a general reference sample Is used where males and females are 
equally represented* there are two alternatives to normlng of scale 
and reporting of results. 

One alternative would be to split the general reference sample 
or other reference sample by gender and provide separate norms for each 
sex so that the average male sample and average female sample Would 
have Identical means and standard deviations. Sudi statistical manlp- 
ulation to achieve equality of score distributions would actually 
disguise the underlying male-female differences. For example, a rav/ score 
of 10 on the ART scale may convert to a standard score of 50 based 
on a male reference sample but the same raw score of 10 for a female 
may convert to a standard score of 45. The end result Is that the same 
degree of Interest v/ould produce different standard scores. (The 
separation of the sexes when constructing empirical occupational 
scales can be faultofj by thf» same arniini'>nt. Hn«.;*»v.''r» tH* in**»*»rcrr^l?- 
tlonal procedure permits the combining of male and female responses 
without "washing out" differential response rate^ and ^Ije predictive 
validity of homogeneous scales in not an overriding concern. Both 
these factors make the use of a combined male-female sample more 
acceptable for internally-based scales.) 

The second alternative is to develop one scale for both males and 
females using tjie general reference sample, norni the scale on the 
male-female composite, and provide normative data separated by 
gender when results are reported. This procedure would convert a raw 
score to the same standard scores regardless of gender, but interpre- 
tive norms would provide necessary information about differences In 
male-female score distributions. This procedure, illustrated below 
is similar to that for reporting scores on the Comparative Guidance 
Placement Ptogram of the College Entrance Examination Doard. 



Recommendation 

The following example Illustrates reportlncj of scores on two 
internally-based types of scales, Mathnatics and Physical Science 
using male-female composite for norroing purposes but the interpretive 
data are split by gender. Standard scores are plotted and easy 
reference to both male and female distributions is possible. Explan- 
atory statements can be preprinted on the profile explaining that 
the white bar represents the nonnal ranne of male scores and tl»c; 
speckled bar roprsents the normal range of female scores \:ith vertical liiuis 
representing the average. Good graphic representation could also 
indicate qyartiles or 10th and 90th percentiles. 

Example of Homoneneous Scale Output 

Scale Score 0 20 VJ.'"^^ 

Kithcr.atics 55 ^ 

Physical Science 51 v i .. .. ' r^t 

This procedure also v/ould have the attendant advantage of provid- 
ing a more meaningful linkage to scores of male and female occupational 
samples. Since a standard score would represent an equivalent degree 
of interest to items on the scale regardless of gender, resultant 
score presentations for occupations (identified by sex) v/ould give 
meaningful cotnparisons betiveen males and females in the same 
occupation. 

The above recommended procedure uses the same items for both 
males and females, yields identical standard scores from the same 
expressed interests on homogeneous scales regardless of gender, but 
takes into account differential response frequencies for each sex 
vihen scores arc related to normative data. 

ERIC 



Sex Differences 

Table 5 shows mean raw score differences betv/een niales and females 
on the homogeneous -type scores for the Kuder General Interest Survey 
(Kuder,1973). All differences between the sexes were statistically 
significant. Males scored much higher on the Mechanical scale 
(more than 19 points) than females, while females had more intense in- 
terests ^^^an males in Social Service. Generally, the results 
pattern sexually stereotypic roles, males scoring higher on Outdoor, 
Mechanical, Computational, Scientific, and Persuasive, while fensalcs 
v/cre higher than males on Artistic, Literary. Musical, Social Service, 
and Clerical. 



Insert Table 5 About Here 



CHANGE OF INTERf.STS OVLR TItlE 

Do interest preferences change over time? For individuals, there 
will be a wide diversity in amount of change and this will depend on 
his or her age when tested. As the person becomes older, more and 
more experiences are encountered which will predispose the person to 
like or dislike that activity more than previously depending on the 
experienced reward value. Roughly one-third of the person's change 
in interest will occur prior to graduating from high school, one-third 
during college years (18-22), and one-third during adulthood with a 
great deal of stability after the age of 25 (Johansson and Campbell ,1?>71). 

Although considerable evidence exists Indicating that people vill 
change in tlieir vocational Interest preferences, do occupational group 
characteristics cJjange sufficiently to Invalidate tests constructed 
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table 5 

Means and Standard Deviations. for Males and Females 
on Homogeneous Scales of Kuder 





Students in 


Grades 6- 


-8 


Students 


in Grades 9- 


12 




Males (M^28?) 


. Females (N=433) 


Males (n=471) 


Females {N=691) 


Scales 


Mean 


S.D. 


Mean 


S.D. 


M>2an 


S.D. 


Mean 


S.D. 


Outdoor 


27.45 


8.73 


22.93 


8.20 


24.50. 


<).71 


21. 5C 


8.30 


Kischa leal 


43.13 


10.20 


26.01 


9.24 


44.74 


11.87 


26.31 


C.84 


Coiuputational 


30.03 


8.34 


27.62 


7.80 


32.99 


9 84 


27.46 


0 70 


Sclenti fic 


39.33 


11.54 


29.04 


10.87 


39.01 


11.71 


29.26 


in. 90 


Persuasive 


54.90 


12.01 


49.53 


11.06 


55.60 


13.15 


51.5/ 


12.54 


Artistic 


29.73 


8.82 


34.47 


9.49 


29.13 


9.35 


33.71 


9.86 


LI terary 


29.41 


9.44 


35.71 


9.34 


30.34 


10.23 


34.49 


10. C8 


Musical 


10.67 


7.51 


12.33 


8.17 


11.75 


8.20 


13.53 


7.50 


Social Service 46.85 


12.45 


59.35 


12.64 


44.34. 


13.C1 


60.40 


13.04 


Clerical 


53.52 


10.13 


61 ./b 


12.45 


52.10 


12.91 


61 .04 


15.47 



ERIC 



-41- 

5, 10, 20, 30 years ago? Specifically 1) are there general chanqes 
across various groups of people over time? 2) are these changes 
associated with the measurement of vocational preferences for specific 
occupational groups?, and 3) are these changes on items that currently 
differentiate the sexes? 

Changes in Item Popularity 

To investigate if there were general changes in interests, occupation- 
al samples tested in the 1930s were contrasted v?ith a matching sr.nple fron 
the same occupation tested in the 1960s (Campbell. 1971). Items that showed 
a significant change in the base rate of responding across a majority of 
the occupational samples were identified on both male and female SVIB. 
For example, about 10% of bankers tested in i 1934 responded "Like" to the 
item "College Professor", while about 30?; of bankers tested in 1964 
responded "Like". Approximately one-fifth of the items investigated fhowod 
large shifts in popularity over the 30 year time span. These types of 
itcmr. v/nre used to develop a Cultural Change Scale; the scale correlated 
in the .80s with test year of the sample. Thus a strong relationship 
existed betv/cen year tested and items comprising the Cultural Change 
scale. Investigation of items indicated that recently tested samples had 
a more positive liking for activities that wore extroverted in content, 
and concurrently out-doors and skilled trades activities became less 
popular. Results v/ere similar for both males and females. 

Changes iin Sroup Characteristics 

Since items could be identified that shov/ed significant changes 
in popularity over time, the next question was whether or not these 
changes also v/ere incorporated into occupational and homogeneous 
scales that measured the interest preferences of criterion groups. 
If occupational interest patterns do change with time, then interest 
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Inventorles should be revised continually. To Investigate this 
problen. Campbell (1966) went back to some of the SVIB's orlqinal 
criterion groups and tested the men In the 1960s who were holdinq 
the exact same jobs as t'lose tested by E.K. Strong in the 1930s. 
Four samplings were done: 1) 1965 ninlsters were matched with 
ministers In the same church in 1927; 2) 1964 bankors were matched 
with man In the same bank position In 1931; 3) corporation presidents 
tested In 1965 were matched with presidents of the same coinpany in 
1935; and 4) 1965 school superintendents were matched with super- 
intendents of the same school system in 1030. The follovnnc table 
presents the data of 12 experimental hor^ocieneous scales (Johansson, 
1969,1974) that permitted comparison between the 1930 and 1960 
testings (the SVIB was revised In the middle 19605, precluding an 
exact comparison on all but 12 homogeneous scales). As is apparent 
in To'^lf 6, the w^^ns betvinm thf» tv/o tp«itino^ wr>rr» vfry i;iinn«>r— 
there were greater differences among the different occupations 
than between two samplings of the same occupation. 



Insert Table 6 About Here 



Inspection of SVIB criterion scores for two samplings of each of 
four occupational groups revealed the same results (Campbell ,1906) . 
The strikinp similarity of results for each of the comparisons 
suggested that occupational scales developed in the 1930s were still 
relevant for the 1960s. Other research by Thrush and King (1965) on 
medical students, by Ferguson (1958, 1960) on life insurance salesmen, 
by Campbell (1965,1968) on male and female osychologists, and by 
Mayarazzo, et. al. (1964) on poltcemen and firemen clearly indicated 
that occupational preference data do not show dramatic shifts over time. 
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However, there was a slight trend in the Campbell (1966) study 
for the more recently tested occupations to score higher on more 
recently developed occupational scales which v;ds attributable to 
using an in-general sample that spanned testings over son'* ^0 years 
and this introduced a "time-biasing" factor. The "modernuption- 
factor" Is a vexing problem and it is difficult to detcmiine precisely 
when a criterion or reference sample becomes outdated. Rocauso of 
the costs and tims in developing sainplos, the d>Ua do not v/arrant a 
complete modernization of the inventory every Irn yor.rs. A^'lur fifi.^on 
to tv/enty years, tin? biasing factors nay lead to spuriously inflat.('<! 
results on recently developed scales if scale construction uses an 
earlier aggregated in-genoral samnlo. Also, aftnr twentv vcars. 
explaining to a client that he or she has interests similar to 
people that were tested twenty years ago» will raise an.eycl.row as to 
relevancy. Thus, after 10-13 years, the test publisher probably 
should consider seriously starting to develop the nccossar/ plans and 
developmental efforts for a majorrevision so it could be completed 
within the ensuing 5 years. 

Changes iq t Iale-Female Pi f ferences 

The preceding data showed that even though occupational character is 
tics are fairly stable ever long time spans, there were also 
individual items that did show sinnificant changes in popularity. 
The next question is: Are items that differentiate the sexes also 
changing over time? 

If differences between male and female vocational preferences arc 
diminishing, then at some future time an Interest inventory could 
be developed where differential base rates of responding would not 
be a factor and sex biasing in test development would become a moot 
point. However, if .differences are not disappearing, then differential 
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response rates by gender v^ill have to be always accomnodatod into 
tlio dovolopnontal systom of interest nieasurement. 

Data presented by Cunpbell (107fl) specifically addresses the 
question of chanqing malo-fomale differences within occupations over 
an years. Nino occup.Ui(»iis v/cre Avallnlilo v/itli iidequate sninplos of 
niAlos and fi-iali's who h.ul hton tor, led durinq the 1030? and the 196ns. 
JuiAv 7 ropi»eU the porccnla'io re'.pnn.linq "Lili*" to four it« :ns that 
typiCiilly show iwlo-fw.iU! differmi.i*'^. The first tv«> it.!!., "Deco- 
rate ft roun with flowor-.." .md "Kiffnil.ir hours ff.r woH," ir.inlly aro 
f.ivorv»d by ffnales and th>.' List tuu» "iVpair electrical \/irinn" awl 
"Lxpress judivpnts openly roqiirdlf.r, of what pfoplc say," .ui* fuvorfl 
rioro bv males. 



Insert IMAv. 7 About ll<'rc 



Inspection of the data in Table 7 clearly indicates tlint there 
has not been a decrease in the maqnitude of the differences between 
males and fcmolcs within occupations from 1930 to 1968. Of the 3fi 
contrasts (nine occupations times four items), 11 '^hov/ed smaller male- 
fonale differences over time while 25 showed largttr differences; dif- 
ferentiation in male-female responding appears to have gram larger. 

The above four examples of items were not atypical; interest 
measurement must take into account the differences In ways that males 
and females respond. Until evidence Is o! ..ned that show dranatic 
shifts are currently occurring, waiting for these differences to dis- 
appear would be futile. 
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SUMflARY 

The scope of this paper has been to review the two major interest 
inventories and to explore the coniplexitios of technical aspects in 
development of item sampling, norming, scoring, and reporting of results 
In relation to the differential treatment of sexes. Using the opt rational 
definition of sex bias as any factor that may influence a pei^on or 
others to limit career opportunities solely on the basis of gender, 
the following areas v/sre Investigated and recomnendations made for 
avoiding potential sex bias. 

Item Sampling 

Interest inventories should not have separate forms of the in- 
strument for males and females. Potential sex biasing should be 
eliminated at the item development level by using the same set of 
items for both males and females. Special care has to bo excerciscd 
so that an item does not inherently make it more applicable to either 
gender. For example, items such as "Policeman" and "Policewoman" are 
less desireable than "Police Officer," or "Realtor" would be prefer- 
able to "Real Estate Salesman" or "Real Estate Salesperson." For 
some types of items, reference to both genders may be the only viable 
alternative, such as "Dressmaker/Tailor" or "Waiter/Waitress." A 
reconroended procedure tfor the test developers would be to request the 
U.S.-GSA Civil Rights Office to review item content of new Inventories 
and suggest changes for any Items that appear to be discriminating 
to minority groups. • 

Scale Development 

There are various factors that have to be considered within the 
context of sex bias when scales are constructed for Interest Inventories. 
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There are two main types of scales (externally and internally bas^d 
scales) constructed for interest inventories and ec:ch has its own dis- 
tinctive Issues relating to sex bias. 

Externally-based Scales 

Externally-based scales (occupational or criterion scales) are 
based on item responses of appropriate criterion samples. For example, 
samples of employed artists » mathematicians, and elementary teachers 
are collected and their responses to the inventory arc tabulated. 
A person's score on criterion scales measures the extent of similarity 
with those in a criterion sample. Of immediate concern in the develop- 
ment of criterion scales is the extent and maqnituUe of differences 
between male and female responses and how to accommodate these dif- 
ferences. For example, should males and females be combinr-d in the 
criterion samnle? Snould scores he renortPd snnaratfly fnr n«l^»s «nrt 
females? Is the magnitude of male-female differences diminishing with 
growing awareness that traditional sex roles arc arbitrary? Should 
scores be reported on scales developed on opposite-sex criterion 
groups when appropriate-sex scales are not available? Do the criterion 
samples become out-dated and need periodic revision? 

Empirical evidence Is very consistent In showing that males 
respond differently than females to many items on interest inventories. 
These differences are neither small nor infrequent. Differential 
responses of males and females should not be Ignored; combining responses 
of males and females In criterion samples would decrease the predictive 
and concurrent validity of empirical criterion scales. Furthermore, 
the magnitude of differences between males and females has not sub- 
stantially diminished during the past 30 years. 
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Since these male-female differences appear to be fairly stable 
and large, occupational scales should be developed on male criterion 
samples and female criterion samples. Every effort should be made to 
collect adequate samples of males and females for each occupation 
represented on the inventory's reporting of results. Furtheiinore » in 
development of empirical scales for both sexes, caution should be 
exercised so as to not differentially include items that represent 
non-valid male-fer.;ale differences. 

When an appropriate sex criterion sample is not available, and 
so as to not limit career options, scores shoold be reported for scales 
based on the opposite sex criterion sample but the user should be 
cogently aware of the impact that sexual stereotypes will have on the 

resultant score. 

S^^bility of response nrf»ferenf*»«J also are evidenfted for oc- 
cupational samples. Occupational scales developed in the past still 
show excellent concurrent validity 30 years later. However, when 
new scales are developed for an old inventory, a "modernization" factor 
occurs in the content of the ne«f scales. Because of cost and time ffxtors 
in revising an inventory, empirical data do not warrant a recoimiended 
nor necessary revision every in years. If newly developed scales are 
added to existing sales on an inventory developed 15-20 years earlier, 
this modernization factor is likely to occur. Thus interest inventories 
should be revised at least after 20 years and preferably after 15 years. 
This revision would update the content of items and develop occupational 
scales that are more responsive to future changes In occupational pat- 
terns in existence within society. 
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Internally-based Scales 

Internally-based scales (homogeneous or basic Interest scales) 
are based on interrelationships of items within the inventory. These 
scales measure content areas of interests such as Mechanical interests, 



Art interests. Social Service interests and are more relevant to the 
measurement of avocational types of interests. Because of the nature 
of scale development (item intercorrolatlons) and the relatively 
lesser importance of predictive validity than for criterion scales, 
responses for males and females could be combined into a genertil 
sample for computincj the necessary statistics for construction of the 
scales. Using one sample with equal representation of males and females 
would lessen the impact of potential sex biasing, but differential 
response frequencies of males and females would still have to be a concern. 
The most meaningful solution would be to use the combined general reference 
sample as the norm sample for converting raw scores to standardized 
scores but providing for separate Interpretive norm distributions for 
males and females. Thus, scores would be equivalent but would have 
different Interpretive meanings dependent upon gender. 

Composition of an appropriate norm reference group for inte.nally- 
based scales would depend on the Intended usage of the Inventory and 
judgment of tne test developer. For example. If the inventory Is in- 
tended primarily for the armed services, the norm sample should be com- 
posed of armed service personnel and not junior high school students; or, 
the test developer may deem that a general sample of adult males and 
females would be a more appropriate reference sample than college graduates. 
In any event, reference groups could be composed of equal representation 
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of males and females if results are presented with separate inter- 
pretive band width? that indiciite male-female differences. 

This use of a single norm sample also permits an easy linkage 
to scores of male and female criterion groups on these homogeneous 
scales since the standard scores would be equivalent in meaning. 
Updating of norm reference samples should be as frequent as that dis- 
cussed previously for updating of occupational samples. 

Socialization Bias 

Many of the technical difficulties outlined above are a result 
of the substantial sex differences to many interest Inventory items. 
The possibility of developing an effective inventory free of such 
bias seems remote. The SVIB has undergone numerous studies over 40 
years to increase its validity, and major revisions in the last 10 
vears have eliminated less valid items, archaic items, and itoms that 
Inherently favor one gender. The result will be a single inventory to 
be released in 1974 where considerable attention has been given to 
sex biasing factors. Still over one-half of the items show significant 
male-female differences. Eliminating items that are not free of 
socialization bias would seriously decrease the validity and viability 
of the instrument. Items such as "Artist" and "Decorating a room with 
flowers" show considerable male-female differences but are very valid 
types of items in differentiating interest of interior decorators 
from other occupations. Data indicate that changes v/lthin society will 
have to be more dramatic than has occurred during the last 30 years 
in reducing male-female stereotypes before effective inventories free 
of socialization can be constructed. 
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Interpretive Materials 

Any published interest inventory should have a sound and com- 
prehensive test manual following the guidelines of the Anerican Psy- 
chological Association as outlined in Standards for Educational and 
Psychological Tests and Manuals (1966) . In addition to recommended 
standards of describing criterion groups, validity, reliability, and 
so forth, a fairly in-depth discussion should be included in the 
manual and other interprcitve materials elucidating t»ie extent and nature 
of male-female differences on the inventory. 

Test developers ohould outline what procedures were used to 
handle male-female response differences during scale construction, 
norming, and reporting of results for the various typos of scales 
that the Inventory may contain. Specific mention should be made as 
to what efforts were made to assure the applicability of the items to 
both genders. Data should be presented Indicating the extent that scales 
developed on either wale or female criterion groups can be generalized 
or used for the opposite sex. 

Sufficient Time and Money 

If sufficient time and money were available, what directions could 
be taken in the development of a new inventory that would lessen or 
eliminate the socialization effect? It may be possible to write a 
sufficient number of items that would make a viable inventory and still 
show no larse differences between sexes. This would involve an extensive 
developmental effort and continual trying out of items on various groups 
of subjects to ascertain the instrument's validity. 

Still, even though enough Items may be aggregated that are relatively 
free of sex biases, small male-female differences may have a cumulative 
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effect during construction and norming of the resultant scales. 
If sex differences existed in the scales, extensive validity studies 
would have to be done to ineasure the extent of the impact of these 
differences. Also, the data may shov/ that for subjects who have well- 
defined and strong career aspirations, male-female differences are a 
minor factor and scales could be developed so as to not have a limiting 
effect on either gender. If results pointed in this direction, use of 
career coimitment may be a more meaningful moderator variable than gender. 

Various statistical procedures could be tried to compensate for 
male-female response differences, such as multivariate discriminant 
analysis, but such procedures would be applying differential weights, 
based on gender, to results and considerable analyses would have to be 
done to show that these procedures do not decrease the validity of the 
inventory for both sexes. 

Ill the long rufi. writing IteiiA thdt cire as vallu as current Hems 
and show no male-female differences may be easier than to try to 
statistically correct for these differences on a post. hoc basis. 
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FOOTNOTE— 

Hhe author Is deeply Indebted to Dr. David P. Campbal], Director 
of the Center for Interest Measurement Research » University of 
Minnesota, for much of the empirical data contained in this report. 
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